Find some good visual resources and the concept will be way easier to understand.
š These are the only two blogs I used initially and keep going back to for refreshing my memory on everything about attention, self-attention, transformers, and the whole foundation of todayās LLMs. I love how visual and interactive they are.
1. Sequence to Sequence (seq2seq) and Attention by Lena Voita: https://lnkd.in/g3vN8ZZ4
2. The Illustrated Transformer by Jay Alammar: https://lnkd.in/eJk-yamh (I'm sure anyone who's worked in NLP before the LLM frenzy has definitely come across this one)
š Read in the same order for better understanding
Added a few images from their blogs to give you a glimpse, but I highly recommend checking them out for the full experience!